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Lon is the first identified ATP-dependent protease highly conserved across all kingdoms. 
Model plant species Arabidopsis thaliana has a small Lon gene family of four members. 
Although these genes share common structural features, they have distinct properties in 
terms of gene expression profile, subcellular targeting and substrate recognition motifs. 
This supports the notion that their functions under different environmental conditions 
are not necessarily redundant. This article intends to unravel the biological role of Lon 
proteases in energy metabolism and plant growth through an evolutionary perspective. 
Given that plants are sessile organisms exposed to diverse environmental conditions 
and plant organelles are semi-autonomous, it is tempting to suggest that Lon genes in 
Arabidopsis are paralogs. Adaptive evolution through repetitive gene duplication events 
of a single archaic gene led to Lon genes with complementing sets of subfunctions 
providing to the organism rapid adaptability for canonical development under different 
environmental conditions. Lon1 function is adequately characterized being involved in 
mitochondrial biogenesis, modulating carbon metabolism, oxidative phosphorylation and 
energy supply, all prerequisites for seed germination and seedling establishment. Lon is not 
a stand-alone proteolytic machine in plant organelles. Lon in association with other nuclear- 
encoded ATP-dependent proteases builds up an elegant nevertheless, tight interconnected 
circuit. This circuitry channels properly and accurately, proteostasis and protein quality 
control among the distinct subcellular compartments namely mitochondria, chloroplasts, 
and peroxisomes. 
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NO GIFT WITHOUT A PRICE: LIFE IN AN AEROBIC WORLD IS 
NOT NECESSARILY PERFECT 

The paradox of aerobic life or the "Oxygen Paradox" argues that 
organisms do not survive in oxygen depleted environments, yet 
oxygen is inherently dangerous to their existence. This "dark side" 
of oxygen is attributed to the damage of biomolecules (Davies, 
1995). Life in an oxygenated environment contributed to the 
evolution of aerobic metabolic processes such as respiration and 
photosynthesis that unavoidably result in the production of molec- 
ular oxygen metabolites known as reactive oxygen species (ROS). 
Although increasing evidence indicates that ROS in plants could 
function as signaling molecules in regulating development and 
pathogen defense response, ROS have the capacity to stochasti- 
cally cause oxidative damage to proteins, DNA, and lipids (Apel 
and Hirt, 2004; Moller et al„ 2007). 

Mitochondria, chloroplasts, and peroxisomes represent sub- 
cellular sources for ROS production and the principle targets 
for oxidative macromolecular damage. In particular, the elec- 
tron transport chain of mitochondria transfers high energy 
electrons to oxygen through a series of inner membrane pro- 
tein complexes. This process of electron transfer from NADH 
or FADH 2 to O2 by the electron carriers, known as oxidative 
phosphorylation (OXPHOS), is leading to energy production 
in the form of ATP. However, through this process leakage of 



electron occurs, ultimately generating highly reactive species, 
causing severe cell damage. This side-effect in mitochondria 
led to the "free-radical theory," conceived in 1956, speculating 
that endogenous oxygen radicals were generated within cells and 
resulted in a pattern of cumulative damage (Harman, 1956). 
Nowadays, this theory is widely appreciated by an increas- 
ing number of scientists from an expanding circle of fields, 
including plant biologists, supporting the role of oxidants in cel- 
lular damage (Beckman and Ames, 1998; Finkel and Holbrook, 
2000). 

To cope with the hostile oxygenated environment, organ- 
isms have evolved sophisticated networks of defense (Finkel 
and Holbrook, 2000; Apel and Hirt, 2004; Friguet etal., 2008). 
The first line of defense against oxidative injury is composed 
of a complex array of ROS detoxifying enzymes and non- 
enzymatic antioxidants that counteract and regulate the over- 
all ROS levels, maintaining physiological homeostasis. During 
physiological steady state conditions the cellular oxidants are 
efficiently scavenged by these antioxidative defense components 
that are often confined to particular compartments. However, 
under persisting adverse oxidative conditions the equilibrium 
between ROS production and scavenging is perturbed resulting 
in rapid intracellular accumulation of oxidants. These distur- 
bances characterized as oxidative stress, induce modifications to 
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both the polypeptide backbone and amino acid side chains of 
proteins. 

As plants are sessile organisms exposed to harsh environmen- 
tal conditions, numerous abiotic conditions result in protein 
misfolding usually caused by ROS-mediated chemical modifica- 
tions. These conditions include exposure to high light intensity, 
drought stress, low or high temperature and mechanical stress 
(Apel and Hirt, 2004; Moller etal, 2007). The misfolded pro- 
teins are particularly prone to oxidation (Dukan etal, 2000) 
leading to the formation of adducts that often bring in carbonyl 
groups and cross-links (Friguet etal, 2008). The carbonylated 
proteins are functionally impaired or completely inactive, cre- 
ating toxic protein aggregates and cross-linked inclusion bod- 
ies that interfere with normal cellular function (Petropoulos 
and Friguet, 2006). Hence, the second line of defense against 
oxidative injury is composed of the protein quality control 
mechanisms that essentially ensure the proper level of func- 
tional proteins within the cell and eliminate non-functional 
proteins. 

The ATP-dependent Lon protease is a key component of pro- 
tein quality control highly conserved across the kingdoms of 
living organisms. This article presents important findings and 
the progress recently made in plants, whereas special emphasis 
is simultaneously given on major scientific breakthroughs regard- 
ing the Lon function in non-plant organisms. This comparative 
approach will contribute toward better understanding of Lon in 
organellar proteostasis and cellular homeostasis. 

THE AAA + Lon PROTEASE IS A MAJOR COMPONENT OF 
PROTEIN QUALITY CONTROL MECHANISM 

Protein fate depends on an elegant protein quality control sys- 
tem that precisely orchestrates protein complex assembly and 
degradation, thereby safeguarding cellular homeostasis especially 
under stress conditions. The role of protein quality control is 
biphasic, as it is composed of energy-dependent repair molec- 
ular chaperones and degradation machines. Chaperones and 
proteases represent two sides of the same coin, acting in oppos- 
ing pathways to clear unfolded proteins from the cell (Voos, 
2013). The molecular chaperones within the cell facilitate the 
folding of newly synthesized proteins into their native con- 
formations, prevent aggregation and assist in the assembly of 
multiprotein complexes. Conversely, ATP-dependent proteases 
degrade irreparably damaged or improperly synthesized pro- 
teins. In the cytosol and nucleus of higher eukaryotes, the 
proteins to be removed are ubiquitylated and delivered to the 
26S proteasome for degradation (Hershko and Ciechanover, 
1998). The 26S proteasome is the most elaborate AAA+ protease 
(ATPases associated with diverse cellular activities), consisting 
of a 20S protease core particle and two 19S regulatory caps 
modulating several aspects of plant development (Coux etal., 
1996; Voges etal, 1999; Smalle and Vierstra, 2004). In con- 
trast to these cellular compartments and as a legacy of their 
endosymbiotic heritage, eukaryotic organelles maintain indepen- 
dent AAA + protein degradation machineries categorized into 
the soluble Lon and Clp (caseinolytic protease) families and the 
membrane-integrated FtsH-class (filament-forming temperature- 
sensitive) proteases (also called as AAA-proteases; Adam etal., 



2001; Sinvany-Villalobo etal, 2004; Sakamoto, 2006; Rigas etal., 
2012; Janska etal., 2013). In the case of FtsH and Lon, the 
ATPase and proteolytic domains are formed by a single polypep- 
tide, whereas in Clp these domains are separate proteolytic (ClpP) 
and chaperone-like (ClpX) subunits. 

Protease La encoded by the Lon gene homolog in Escherichia 
coli, was the first discovered AAA+ protease (Chung and Gold- 
berg, 1981). As "La" is the sixth musical note of the solfege syllable, 
the nomenclature describes the order of Lon elution from the 
chromatographic analysis of E. coli soluble proteolytic enzymes 
(Swamy and Goldberg, 1981). Lon is an ubiquitous proteolytic 
machine present in unicellular and multicellular organisms. The 
Lon protease consists of three functional domains (Rotanova et al., 
2006; Rigas et al., 2012). The long N-terminal domain that possibly 
together with the central AAA+ module selectively interact with 
target proteins and the C-terminal proteolytic domain (P-domain) 
with a typical Serine-Lysine catalytic dyad at the active center 
(Botos etal., 2004). In plants, the N-domain and the P-domain 
exhibit the highest evolutionary conservation. On the contrary, 
the AAA+ module that includes the Walker Box A and B motifs 
shows the highest degree of divergence in both amino acid com- 
position and length, and is responsible for binding and hydrolysis 
of ATP (Rigas etal., 2009b). The orthologs of Lon are divided 
into two subgroups (Rotanova etal., 2006; Rigas etal., 2012): 
A type (A-Lons), which have a large multi-lobed N-terminal 
domain together with the ATPase and protease domains, and B 
type (B-Lons), which instead of the N domain have a membrane- 
anchoring region emerging from the ATPase domain. B-Lons 
are exclusively present in Archaea that lack FtsH and the Clp 
proteases and thereby B-Lons are the major ATP-dependent prote- 
olytic machines in those cells. The soluble A-Lons are found in all 
bacteria and in eukaryotic cell organelles, such as mitochondria, 
chloroplasts, and peroxisomes (Lingard and Bartel, 2009; Rigas 
etal, 2009a,b, 2012). In the yeast Saccharomyces cerevisiae, Piml 
(proteolysis in mitochondria) the homologous Lon protease has 
a major role in mitochondrial proteostasis as this organism lacks 
Clp (Venkatesh et al, 2012). 

As a chambered protease, the 26S proteasome degrades pro- 
tein substrates that carry multiple ubiquitin moieties (Hershko 
and Ciechanover, 1998). Given that mitochondria do not exhibit 
a covalent tagging system for damaged proteins like the ubiq- 
uitin tag in the cytosol or nucleus, substrate selectivity of Lon 
ATP-dependent protease most likely depends on the intrinsic 
characteristics of the polypeptide to be degraded. Lon preferen- 
tially degrades to completion damaged or misfolded polypeptides 
having a 50-60 amino acid long unstructured and exposed 
protein segment with compromised conformational state (von 
Janowsky et al., 2005) . Upon protein misfolding, specific sequences 
rich in aromatic and hydrophobic residues become accessible to 
be recognized by Lon (Gur and Sauer, 2008). Moreover, Lon 
can also degrade folded unassembled polypeptides that can be 
accommodated into the proteolytic central channel with surfaced- 
exposed hydrophobic residues located within a highly charged 
environment (Ondrovicova et al., 2005). Consequently, Lon selec- 
tively degrades untagged non-natively folded substrates or folded 
but unassembled subunits, ultimately protecting the functional 
integrity of the organellar proteome. 
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EXPRESSION AND PROTEIN TARGETING OF Arabidopsis Lon 
GENES 

The protein isoforms of Lon are encoded by small nuclear 
gene families and predicted to be targeted to distinct subcel- 
lular organelles. In Arabidopsis, four nuclear genes have been 
identified that potentially encode for members of the Lon fam- 
ily (Sinvany-Villalobo etal, 2004; Janska etal., 2010; Rigas etal., 
2012). On the basis of publicly available microarray data in the 
Genevestigator database and scientific reports (Rigas etal., 2009a) 
Lon genes in Arabidopsis are expressed in various cell types and 
tissues. Nevertheless, the Lon genes show distinct expression pro- 
files (Figure 1). The expression of Lonl (At5g26860) is high in 
rapidly growing organs of embryonic origin and in high-energy 
dependent tissues, which have increased mitochondria population 
per cell to sustain increased energy requirements. Lonl is pre- 
dominantly expressed in germinating seeds, embryonic organs, 
including cotyledons and primary roots, and in organs with high 
growth rates such as developing inflorescences, while it was hardly 
detected in mature roots or stems of adult plants (Figure 1; Rigas 
etal., 2009a). In comparison to Lonl, Lon4 (At3g05790) shows 
the lowest level of expression, albeit Lon4 gene response is qual- 
itatively similar to Lonl. Among the members of the Lon gene 
family, Lon2 (At5g47040) is highly expressed, while gene expres- 
sion generally remains constant among the examined cell types and 
tissues without significant fluctuations. Due to the lack of exper- 
imental evidence to report the presence of gene transcripts, Lon3 
(At3g05780) is presumed to be a pseudogene (Ostersetzer etal., 
2007; Rigas etal., 2009a). However, microarray data strongly sup- 
port that Lon3 expression dominates in sperm cells. This specific 



Lon3 expression profile implies a potential role in plant repro- 
duction and particularly in male gametes maturation and double 
fertilization. Apart from the sperm cells, the marginal level of Lon3 
expression detected in other tissues most likely represents experi- 
mental noise impossible to be filtered as Lon3 and Lon4 are almost 
identical. 

Most of the nuclear-encoded proteins are specifically targeted 
to a single organelle. However, dual-targeting of proteins to mito- 
chondria and chloroplasts has been surprisingly frequent due to 
their post-endosymbiotic evolution (Millar etal., 2006; Baudisch 
et al, 2014). Two types of dual-targeting configurations have been 
reported in plants: twin and ambiguous presequences (Peeters and 
Small, 2001; Silva-Filho, 2003; Mackenzie, 2005; Baudisch etal, 
2014). The ambiguous presequence generates a single protein iso- 
form with a targeting peptide recognized by the import apparatus 
of both mitochondria and chloroplasts. Although this configura- 
tion can be organized in domains determining targeting specificity 
to an individual organelle, the signals responsible for organellar 
targeting most frequently overlap (Berglund et al, 2009a). Hence, 
ambiguous presequences cannot be completely distinguished from 
organelle-specific targeting peptides and they have an intermedi- 
ate amino acid composition using the same organellar import 
pathways as the organelle-specific proteins. Despite the fact that 
the determinants for dual-targeting are not fully understood, the 
physicochemical properties within the N-terminal of the ambigu- 
ous presequences including hydrophobicity, the charge of amino 
acids and secondary structure, modulate the double localization 
(Berglund etal, 2009b; Ge etal., 2014). The twin presequences 
include two distinct targeting domains arranged in tandem at 
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FIGURE 1 | Comparative analysis of the Arabidopsis Lon gene expression profiles in various cell types and tissues. Gene expression data were obtained 
from the Genevestigator server (Hruz etal., 2008). The vertical axis uses a base 10 logarithmic scale. 
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the N-terminus. In eukaryotes, twin presequences can confer 
dual-targeting to distinct subcellular compartments by employing 
two alternative in-frame translational initiation codons (Danpure, 
1995; Silva-Filho, 2003; Carrie et al„ 2009; Carrie and Small, 2013). 
Both ambiguous and twin presequences amplify the number of 
protein isoforms in subcellular compartments without affecting 
genome size. The majority of dual-targeted proteins in plants con- 
tain an ambiguous presequence showing an overall prevalence over 
twin presequences (Carrie etal., 2009). 

The protein isoforms encoded by the nuclear Lon genes in 
Arabidopsis are scattered to plant cell organelles mainly involved 
in energy metabolism by utilizing different mechanisms of pro- 
tein translocation (Figure 2). Lon4 is dual-targeted to mito- 
chondria and chloroplasts displaying an ambiguous presequence 
(Sakamoto, 2006; Ostersetzer etal., 2007). The C-terminus of 
Lon2 bears a type 1 peroxisome-targeting signal (PTS1) conferring 
protein localization in peroxisomes (Lingard and Bartel, 2009). 
Computational analysis of Lon3 N-terminal domain identified a 
potential ambiguous presequence for dual-organellar localization 
to chloroplasts and mitochondria. The Lonl dual-targeting is reg- 
ulated both at the transcriptional and translational level (Daras 
etal, 2014). 

THE SUBSTRATE RECOGNITION MOTIFS ARE HIGHLY 
VARIABLE AMONG THE Arabidopsis LON PROTEIN 
ISOFORMS 

The AAA+ Lon protease is a soluble single-ringed multimeric 
holoenzyme. Adjacent to the AAA + module is the sensor- and 



substrate-discrimination (SSD) domain mainly involved in mod- 
ulating selective substrate recognition by Lon so as the target 
protein to be degraded. In line with its highly selective mode 
of action, the SSD domain exhibits substantial interspecies and 
within the same species diversity (Rigas etal., 2009b). The yeast 
S. cerevisiae Lon purified from mitochondria is a ring-shaped 
complex with seven flexible subunits as determined by ana- 
lytic ultracentrifugation and cryoelectron microscopy (Stahlberg 
etal., 1999). Subunits of E. coli Lon are known to assemble 
into ring-shaped homohexamers that enclose an internal degra- 
dation chamber. These hexamers may also interact to form a 
dodecamer at physiological protein concentrations (Vieux etal., 
2013). Insights may also be gained from the structure of an intact, 
assembled Lon protease from the hyperthermophilic archaea Ther- 
mococcus onnurineus (TonLon) that is ubiquitously present in 
various deep-sea hydrothermal vent systems. TbnLon is a 635- 
residue protein belonging to the B-Lon family having the protease 
domains arranged with a near perfect sixfold symmetry rela- 
tive to the axial pore (Cha etal., 2010). This crystal structure 
suggests that the P-domains of each subunit form a bowl-like 
chamber with a lid formed by the AAA + domains, such that 
substrates and degradation products may enter and exit the prote- 
olytic chamber via opposing axial pores. Likewise, the homology 
model of human Lon suggests an hexameric complex formation 
that has an asymmetric, open-ring arrangement reminiscent of 
yeast Lon, albeit the yeast Lon fails to be modeled as a hex- 
amer (Venkatesh etal., 2012). The structural features of Lon 
proteases in line with homology modeling provide conclusive 
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FIGURE 2 | Arabidopsis Lon protein isoforms are present in the Lon4 are dual-targeted to mitochondria and chloroplasts due to twin and 

subcellular compartments involved in energy metabolism. The majority of ambiguous presequences, respectively, at the N-terminus of the encoded 

nuclear-encoded organellar proteins are translated on cytosolic ribosomes. In protein isoforms. Lon2 is imported into peroxisomes by a canonical 

a distinct post-translational process, they have to be transported to their final C-terminal type 1 peroxisome-targeting signal (PTS1 ). Lon3 subcellular 

destination in the organelle directed by specific targeting signals. Lon1 and localization remains elusive as yet. 
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evidence that distinguishes bacterial and human Lon proteases 
as hexameric complexes from yeast Lon, which is uniquely 
heptameric. 

As the SSD domain is the most variable domain among the 
Arabidopsis Lon proteases, the architectural features of protein 
monomers were analyzed by molecular modeling. These ribbon 
models were in turn compared with the hexameric complexes of 
bacterial (£cLon) and human (HsLon) Lon proteases and with 
the heptameric complex of yeast Piml. Homology modeling con- 
firmed that iicLon and HsLon share the same structural features 
but differ from the heptameric Piml complex (Figure 3A). As 
reported by Venkatesh etal. (2012), this is most likely explained 
by the primary amino acid sequences of _BcLon and HsLon, 
which are significantly shorter than Piml and Arabidopsis Lon 
sequences (Rigas etal, 2009b). Interestingly, the analysis revealed 
that structurally the Arabidopsis Lon proteases deviate from the 
hexameric EcLon and HsLon proteases fitting best with the 
heptameric yeast structure and due to distinct structural fea- 
tures they are classified into two groups (Figure 3A). The first 
group includes AfLonl and AfLon3 that share similar structural 
properties with Piml, all preserving a single pair of parallel 
^-sheets (depicted in red color) that is typical of £cLon and 
HsLon. AfLon2 and AtLon4 belong in the second group bear- 
ing between the a-helices (depicted in gray color) an internal 
loop polypeptide configuration (depicted in yellow color) with 
different secondary structure (depicted in purple color) com- 
pared to the members of the first group. Additionally, a surprising 
core structure is discovered in Piml and Arabidopsis Lon pro- 
teases likely originating from the hexameric .EcLon and HsLon 
complexes (Figure 3B). The alignment between the models of 



AfLonl or AfLon4 with the core structure of HsLon show that 
AfLonl internal geometry differs from AfLon4. The internal loop 
domain of AfLon4 shows a right-handed extension, whereas in 
AfLonl is left-handed. This structural difference between the 
two major representative proteases of Arabidopsis suggests that 
Lonl and Lon4 are gene paralogs performing specialized functions 
without being necessarily redundant. However, the possibility of 
recognizing similar protein targets under adverse environmental 
conditions that considerably modify the internal milieu of mito- 
chondria and chloroplasts cannot be excluded. Future studies 
are required to assess the homology models of Arabidopsis Lon 
proteases and to solve the crystal structure of the holoenzyme, 
providing insights on Lon structural dynamics and functional 
versatility. 

Lonl AND Lon4 PARALOGS EVOLVED DISTINCT STRUCTURAL 
AND FUNCTIONAL FEATURES 

Ancient invasions by eubacteria gave rise through symbiosis 
to mitochondria and chloroplasts that have enormous impact 
on bioenergetic and metabolic homeostasis of plants (Dyall 
etal, 2004). Mitochondria originated first from an endosymbi- 
otic event of a-proteobacterial fusion. A second cyanobacterial 
invasion supplied the plant cell with the present-day chloro- 
plast capable for photosynthesis. During the endosymbiotic 
process, the symbionts lost their autonomy by massive trans- 
fer of their genetic information to the host nucleus resulting 
in genetic redundancy. The evolution and establishment of the 
protein translocation machinery caused bulk gene loss leading to 
organellar genome reduction. The translocation process involved 
N-terminal extensions of the nuclear-encoded precursor protein 





FIGURE 3 | Molecular modeling provides insights into the structural 
features of Arabidopsis Lon proteases. (A) Monomer structure comparison 
of the ribbon model of the sensor- and substrate-discrimination (SSD) domain 
discriminates the hexameric bacterial and human Lon complexes 
encompassed by red eclipse from possibly heptameric complexes of Piml 
and Arabidopsis Lon homologs. On the basis of discrete structural features 
the Arabidopsis proteases could be further categorized into two groups 
encompassed by green and yellow eclipses. The Lon protein accessions and 
the coordinates of the SSD domains given in parentheses are AfLonl : 
NP_568490 (603-739), AtLon2: NP_568675 (547-784), AfLon3: NP_566258 



(586-726), and AfLon4: NP_566259 (585-733) from Arabidopsis thaliana, the 
Homo sapiens Lon: NP_004784 (662-747), Piml: P36775 (772-911) from 
Saccharomyces cerevisiae and EcLon: AAC36871 (494-580) from Escherichia 
coli. Modeling of the SSD domain was performed on the basis of known 
crystallographic data mainly available from AAA+ proteins and bacterial Lon 
proteases, which were automatically detected by the Phyre2 Protein Fold 
Recognition Server (www.sbg.bio.ic.ac.uk/phyre2). The ribbon model was 
generated in PyMol (www.pymol.org). (B) Homology modeling distinguishes 
AfLonl (green) protease from AfLon4 (yellow), albeit both preserve the core 
structure of the hexameric human Lon (red) homolog. 
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synthesized on cytoplasmic ribosomes. Coordinated evolution of 
protein import machineries from ancient symbionts led to dual- 
targeting of nuclear-encoded proteins to both mitochondria and 
chloroplasts. 

Contrary to Lonl dual-targeting that is attributed to twin pre- 
sequences, an ambiguous presequence confers Lon4 dual-targeting 
specificity (Sakamoto, 2006; Ostersetzer etal., 2007). Besides the 
annotated initiation codon, surprisingly an additional upstream 
AUG is present in Lon4 at the same place as the first initia- 
tion codon of Lonl. However, the reading frame between the 
first and second initiation codons of Lon4 is interrupted by a 
single thymine base insertion that results in a TGA stop codon 
(Figure 4). Upon removal of this base the reading frame becomes 
open encoding an N-terminal extension conferring Lon4 targeting 
specificity to chloroplasts similarly to the N-terminal extension 
of Lonl. Moreover, Lonl and Lon4 are remarkably similar in 
terms of amino acid identity and similarity of the structural 
domains besides the SSD (Rigas etal., 2009b). As the structure 
of the SSD domain is tightly associated with Lon proteolytic 
activity (Figure 3), Lonl and Lon4 gene duplicates were at the 
molecular level preserved through adaptive evolution with com- 
plementing sets of subfunctions (Lynch and Conery, 2000). The 
process of subfunctionalization provides an adaptive advantage 
by permitting a dynamic model of gene regulation so that each 
daughter protein performs a specialized function with greater 
precision than the ancestor. Taken together, these observations 



support the notion that Lonl and Lon4 are gene paralogs that 
evolved distinct mechanisms for dual-targeting and subsets of 
function. 

The features of the two paralogs support the existence of an 
archaic Lon gene originated from the first bacterial endosymbiotic 
event that duplicated leading to the ancestral Lonl and Lon4 genes 
with twin-presequence structures for dual-organellar targeting 
(Figure 4). This duplication event most likely occurred upon land 
plant evolution over 400 million years ago. Both ancestral paralogs 
evolved through adaptive evolution discrete SSD domains and 
thereby specialized roles in plant development and stress-response. 
Moreover, the ancestral Lon4 gene evolved the mitochondrial pre- 
sequence into an ambiguous one. This evolutionary step was 
characterized by a single thymine insertion interrupting the read- 
ing frame of Lon4 to prevent the synthesis of the chloroplast transit 
peptide. This evolutionary process drifts protein dual-targeting 
from twin presequences to the ambiguous one. Considering that 
the cases of protein dual-targeting by ambiguous presequence pre- 
vail over twin presequences (Carrie et al, 2009), the dual-targeting 
driven by twin presequences plausibly represents an evolutionary 
fossil. In line with this model, additional duplication events of 
the archaic or the ancestral Lon genes likely occurred generat- 
ing the present-day Lon3 gene that is in close proximity to Lon4 
in a head-to-tail orientation and Lon2. While Lon2 acquired a 
peroxisome-targeting signal, the mitochondrial presequence dete- 
riorated. This series of duplication events does not exclude other 
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FIGURE 4 | Model for the evolution of Arabidopsis Lonl and Lon4 gene 
paralogs. The Lon gene in plant organelles is of bacterial origin, most likely 
introduced upon the first endosymbiotic invasion. This archaic Lon gene was 
first duplicated to the ancestral Lonl and Lon4 genes that evolved 
dual-organellar protein translocation properties by acquiring twin N-terminal 
presequences. The two ancestral paralogs were further diverged to the 



present-day Lonl and Lon4 genes with discrete targeting mechanisms and 
SSD domains. A single base insertion between the two AUGs of the ancestral 
Lon4 was the impetus to evolve an ambiguous presequence for 
dual-organellar targeting. The variation of the SSD domain between Lonl and 
Lon4 is associated with complementing sets of subfunctions allowing, 
through adaptive evolution, the maintenance of both functional paralogs. 
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evolutionary pathways resulting in quadruple Lons in Arabidop- 
sis genome. Nevertheless, the features of Arabidopsis Lon genes 
that determine protein isoform translocation in plant organelles 
together with the structure of the SSD domains argue in favor of 
the proposed model. 

Lonl-DEPENDENT MITOCHONDRIAL BIOGENESIS IS 
ASSOCIATED WITH OXPHOS CAPACITY 

Seed germination and seedling establishment depend on the 
assembly or biogenesis of mitochondria and the mobilization 
of storage reserves. In oilseed species like Arabidopsis, seedling 
establishment is supported by soluble sugars that are gener- 
ated by storage oil mobilization. The mobilization of storage 
oil to sucrose involves main biochemical pathways compartmen- 
talized into distinct organelles. The triacylglycerols contained 
in oil bodies are hydrolyzed to free fatty acids (FFAs). The 
FFAs are imported into the peroxisome entering the reactions 
of (5-oxidation and the glyoxylate cycle. Seedling establish- 
ment additionally depends on the mitochondrial tricarboxylic 
acid (TCA) cycle and on gluconeogenesis that operates in the 
cytosol. 

Molecular genetics revealed that Lonl is involved in the bio- 
genesis and maintenance of mitochondrial function to ensure 
the proper operation of such biochemical network. Transmis- 
sion electron microscopy studies of lonl mutants revealed the 
presence of mitochondria with abnormal morphology. The lonl 
mitochondria are swollen, having a poorly developed internal 
membrane system composed of few discernible cristae (Rigas et al., 
2009a). These ultrastructural features of lonl mitochondria are 
reminiscent of the pro-mitochondrial morphology of dry seeds, 
supporting the role of Arabidopsis Lonl protease in mitochondrial 
biogenesis during germination. Likewise, electron microscopy 
performed on Lon-deficient mitochondria of yeast (Suzuki etal., 
1994) and human (Bota et al., 2005) cells revealed aberrant mito- 
chondrial morphology with electron-dense inclusion bodies in 
the mitochondrial matrix most likely representing oxidatively 
modified and aggregated proteins. These severe phenotypes of 
Lon deficiency across eukaryotes demonstrate the importance 
of this proteolytic machine to maintain proper mitochondrial 
function. As the main mitochondrial electron transport chain 
consists of coupled respiratory chain complexes found in the 
inner mitochondrial membrane, the OXPHOS capacity of lonl 
mitochondria is expected to be impaired due to the abnor- 
mal mitochondrial morphology. Mitochondria isolated from 
lonl mutants showed reduced respiratory capacity when oxi- 
dizing succinate and cytochrome c via decreased activity of 
complexes II and IV, respectively (Rigas etal., 2009a). Addition- 
ally, in the absence of Lonl the activities of at least five TCA 
cycle enzymes were significantly decreased. Analysis of the mito- 
chondrial proteome revealed that complex I was additionally 
affected in lonl mutants (Solheim etal., 2012). Taken together, 
these results support the notion that Lon protease sustains the 
activity of major OXPHOS complexes during germination in 
Arabidopsis. 

Despite primary metabolism and energy supply through 
OXPHOS, mitochondria also play a crucial role in cell signaling 
and communication. In mammalian cells, Lon protease under 



hypoxic conditions optimizes the activity of the electron trans- 
port chain by modulating the equilibrium between cytochrome 
c oxidase (COX; complex IV) subunits COX4-1 and COX4-2 
(Fukuda etal., 2007). Under reduced O2 availability, the hypoxia- 
inducible transcription factor HIF-la binds to hypoxia response 
elements (HRE) of Lon gene promoter leading to the induc- 
tion of Lon expression for the degradation of COX4-1. At the 
same time, HIF-la up-regulates an alternate isoform-COX4-2, 
which is assembled into the COX complex replacing COX4-1. In 
hypoxic cells, the COX4-2 containing complexes are better opti- 
mized for transporting electrons and increasing the efficiency of 
respiration. Additionally, Lon expression is enhanced in vitro by 
hypoxia or under endoplasmic reticulum (ER) stress and in vivo 
by brain ischemia (Hori et al, 2002). Under hypoxia or ER stress, 
a novel signaling pathway from ER to mitochondria disturbs the 
expression and assembly of COX, whereas the expression of Lon 
protects the mitochondria from unassembled complexes. Intrigu- 
ingly, Lon was recently reported to be implicated into the cellular 
homeostasis of the bZip transcription factor ATFS-1 (activating 
transcription factor associated with stress- 1) that is required 
for the mitochondrial unfolded protein response (UPR mt ) cas- 
cade (Nargund et al, 2012). During mitochondrial stress, ATFS-1 
accumulates in the nucleus and activates the UPR mt as ATFS-1 
import in mitochondria is inhibited due to reduced mitochon- 
drial import efficiency by the localized in the inner mitochondrial 
membrane ATP-binding cassette transporter HAF-1 [half trans- 
porter (P-glycoprotein related)]. In healthy cells, the UPR mt is 
not activated as ATFS-1 is compartmentalized away from the 
nucleus efficiently imported in mitochondria matrix, where is 
rapidly degraded by the Lon protease. Consequently, mitochon- 
drial homeostasis is maintained by the conditional-dependent 
translocation of a transcription activator between the nucleus, 
where it activates the stress response cascade, and mitochondria 
where it is removed by Lon protease. 

CONCLUSIONS AND FUTURE PERSPECTIVES 

Protein misfolding and degradation, especially in mitochondria 
which are the main source for oxidants in the cell, are processes 
that determine protein fate causing mitochondrial dysfunction. 
Mitochondrial dysfunction has now been implicated in aging, can- 
cer and in a variety of age-related degenerative diseases. Lon, in 
association with other AAA+ proteases, modulate protein quality 
control, constitutive metabolism and adaptive responses to cellular 
or environmental stress. Our understanding of the physiological 
role of Lon proteases in plants is still evolving, although great 
advancement is made in non-plant species. However, contrary 
to the bacterial, yeast and mammalian counterparts, Arabidop- 
sis has a genetic pluralism in terms of Lon gene copies within 
the nuclear genome. This could be attributed to the presence of 
an additional organelle in plants, the chloroplast, and to the fact 
that plants are sessile organisms exposed to extreme environmen- 
tal conditions. The Arabidopsis Lon genes could be considered 
paralogs that evolved distinct structural and functional features 
including gene regulation and expression, subcellular targeting 
localization and substrate recognition mechanisms. Moreover, 
Arabidopsis has the genetic and molecular tools to contribute 
toward better understanding of the functional role of Lon as key 



www.f rontiersin .org 



April 2014 | Volume 5 | Article 145 | 7 



Rigas etal. 



Lon proteases in Arabidopsis energy metabolism 



controller of proteostasis in organelles and in response to intrinsic 
or environmental cues. These Arabidopsis paralogs could be proven 
valuable assets to unravel the substrate recognition mechanisms 
and organelle-to-nucleus communication circuits. This knowl- 
edge might be of use to precisely comprehend the role of Lon in 
non-plant species including humans and thereby to improve life 
quality and expectancy. 
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